High-dimensional data visualization by interactive construction of low-dimensional parallel coordinate plots
نویسندگان
چکیده
Parallel coordinate plots (PCPs) are among the most useful techniques for the visualization and exploration of high-dimensional data spaces. They are especially useful for the representation of correlations among the dimensions, which identify relationships and interdependencies between variables. However, within these high-dimensional spaces, PCPs face difficulties in displaying the correlation between combinations of dimensions and generally require additional display space as the number of dimensions increases. In this paper, we present a new technique for high-dimensional data visualization in which a set of lowdimensional PCPs are interactively constructed by sampling user-selected subsets of the high-dimensional data space. In our technique, we first construct a graph visualization of sets of well-correlated dimensions. Users observe this graph and are able to interactively select the dimensions by sampling from its cliques, thereby dynamically specifying the most relevant lower dimensional data to be used for the construction of focused PCPs. Our interactive sampling overcomes the shortcomings of the PCPs by enabling the visualization of the most meaningful dimensions (i.e., the most relevant information) from highdimensional spaces. We demonstrate the effectiveness of our technique through Email addresses: [email protected] (Takayuki Itoh), [email protected] (Ashnil Kumar), [email protected] (Karsten Klein), [email protected] (Jinman Kim) Preprint submitted to Elsevier September 20, 2016 two case studies, where we show that the proposed interactive low-dimensional space constructions were pivotal for visualizing the high-dimensional data and discovering new patterns.
منابع مشابه
VizCluster and Its Application on Clustering Gene Expression Data
Visualization enables us to find structures, features, patterns and relationships in a dataset by presenting the data in various graphical forms with possible interactions. A visualization can provide a qualitative overview of large and complex datasets, can summarize data, and can assist in identifying regions of interest and appropriate parameters focused on quantitative analysis. Recent deve...
متن کاملVisual Clustering and Classiication: the Oron- Say Particle Size Data Set Revisited 2.2 Parallel Coordinate Plots 2.3 Linked Low{dimensional Views 3 the Oronsay Particle Size Data Set
Interactive statistical graphics can be eeectively used to nd natural groupings in observations. In this paper we want to demonstrate how clustering and classiication can be done with three approaches based on highly interactive graphical environments: high{dimensional scatterplots as available in XGobi, parallel coordinate plots as available in ExplorN, and linked low{ dimensional views as ava...
متن کاملHDDVis: An Interactive Tool for High Dimensional Data Visualization
Current high dimensional data visualization tools are not effective in helping users gain insights of the data for many domains and tasks. This paper presents HDDVis, a very accessible High Dimensional Data Visualization tool that allows user to interactively explore the datasets from both low-dimensional projections and parallel coordinates. The dimensionality reduction methods used by HDDVis ...
متن کاملThe parallel coordinate plot in action : design and use for geographic visualization 3
7 Implementations of interactive parallel coordinate plots in geographic visualization systems are presented. The plots represent spatial and spatio-temporal data, and are linked to maps and 9 scatterplots. The interactive features of the parallel coordinate representations are discussed, with particular emphasis on their ability to facilitate geographic data exploration and understand11 ing. T...
متن کاملThe parallel coordinate plot in action: design and use for geographic visualization
Implementations of interactive parallel coordinate plots in geographic visualization systems are presented. The plots represent spatial and spatio-temporal data, and are linked to maps and scatterplots. The interactive features of the parallel coordinate representations are discussed, with particular emphasis on their ability to facilitate geographic data exploration and understanding. The pape...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- J. Vis. Lang. Comput.
دوره 43 شماره
صفحات -
تاریخ انتشار 2017